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CRITICAL INCIDENTS REFLECTING OUTSTANDINGLY EFFECTIVE 
BEHAVIOR IN STUDENT TEACHING OF HOME ECONOMICS WERE COLLECTED 
FROM COLLEGE SUPERVISORS i COOPERATING TEACHERS i AND STUDENT 
TEACHERS AND CATEGORIZED ACCORDING TO SPECIFIC BEHAVIORSi AND 
PROVIDED A BASIS FOR CONSTRUCTING A RATING SCALE FOR 
EVALUATING STUDENT TEACHING PERFORMANCE. THE CRITICAL 
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FIVE-POINT CONTINUUM OF 112 UNI DIMENSIONAL ITEMS. ANALYSIS OF 
THIS SCALE I RS-112i BY THE DARLINGTON PROCEDURE WITH STUDENT 
GRADES AS CRITERION MEASURES RESULTED IN A SCALE, RS-35, 
35-ITEM WITH AT LEAST ONE ITEM IN EACH OF 10 MAJOR BEHAVIOR 
CATEGORIES. USE OF THE RS-35 GAVE A BETTER DISTRIBUTION OF 
RATINGS THAN RS-112. THE MAJORITY OF COOPERATING TEACHERS AND 
COLLEGE SUPERVISORS REPORTED USING A RATING SCALE. ABOUT HALF 
PREFERRED RS-35 TO RS-112 AND OVER HALF LIKED IT BETTER THAN 
THEIR PRESENT INSTRUMENT. MOST REPORTED A 20-MINUTE TESTING 
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USING THE FIVE ITEMS EMERGING FROM MULTIPLE REGRESSION 
ANALYSIS FOR A SHORTER RATING SCALE, (4) INCLUDING A PLACE 
FOR RATER COMMENTS ABOUT STUDENT TEACHER GROWTH, AND <5) 

USING THE SCALE AT THE MIDDLE AND CONCLUSION OF STUDENT 
TEACHING. BOTH SCALES, OPINIONNAIRES, CATEGORIES OF STUDENT 
BEHAVIOR, AND THE INFORMATION FORM FOR REPORTING CRITICAL 
INCIDENTS ARE INCLUDED. (FP) 
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INTRODUCTION 



The student teaching experience affords the opportunity to 
detvelop competencies inherent in effective teaching. The pro- 
fessional supervision of student teachers is an attempt to give 
direction to this experience. Evaluation of student teacher per- 
fomance, i.e. the student teaching grade, provides an index of 
effective teaching and, to this extent, evaluation can be used to 
clariiy objectives and improve role competency. A valid and re- 
liable evaluation device contributes to the efficacy of assessment 
and therefore to the fulfilment of student teaching goals. Current 
literature reveals that there is a dearth of such instruments and 
that many educators eare dissatisfied with present patterns of 
evaluation of student teaching. 

Purpose and Nature of the Research 

In an attempt to facilitate the process of student teacher 
evaluation in home economics, a study was designed to determine 
by means of the critical incident technique the performances 
contributing to effective and ineffective student teaching be- 
havior, and to construct a rating scale to evaluate those per- 
formances. The critical incident technique employs collected 
reports of effective and ineffective behavioral patterns as 
they relate to the aim of the activity being analyzed. The 
present study used this means to identify student teacher be- 
haviors related to and reflexive in effective or ineffective 
teaching. Based on the categorized incidents, descriptions of 
unidimensional student teacher behavior were written and assembled 
in rating scale form. It was expected that the rating scale 
would thus describe student teacher behaviors which are signifi- 
cant in the performeuice of the student teacher. 

Following independent use of the rating scale by cooper- 
ating teachers, student teachers cud college su}..ervisors , analy- 
sis of the data was executed using the student teaching grade 
as the criterion measure of student teaching performance. The 
theoretical work of Thorndike (30) and the implementation of 
this by Darlington (lb) provided the base for the test analysis. 
Regression analysis predicting the student teaching grade from 
the rating scale items was performed. 

A second stage rating scale composed of those items from 
the preliminary scale statistically determined to be the most 
iog)ortant indicators of student teaching performeuice , was formu- 
lated and administered to a second sample for purposes of cross 
validation. This instrument was tested and its reliability and 
usefulness as an evaluative device in the student teaching ex- 
perience were assessed. The reliability of the instrument was 



determined by inter rater » split»half, and item discrimination 
methods. The usefulness of the rating scale was investigated 
through administration of two opinionnaires , one designed for 
student teachers, euid one for cooperating teachers and college 
supervisors. 

The study was limited to evaluation of student teachers in 
home economics, and evaluation was restricted to the use of a 
rating scale at the end of the student teaching experience* 
Operational definitions of terms used in this report may be 
found in the glosscury. 

Related Research 

In reviewing literature relevant to the topic of this study, 
five areas were covered: (l) evaluation of student teaching in 

home economics, (2) evaluation of student teaching in general, 

(3) teacher effectiveness, {k) the critical incident technique, 
and (3) rating scales. 

Student Teaching in Home Economics . Studies of evaluation 
of student teaching in home economics have centered on surveying 
practices and instruments used to assess the effectiveness of 
student teaching (10) and the effect of such upon the student 
teacher (27), construction of scales based on the opinions of 
qualified teacher educators or cooperating teachers as to the 
aspects of teaching necessary to measure (33), and evaluation 
of the performance of instruments (13). 

Student Teaching . In the literature on student teaching in 
general, there is considerable evidence of dissatisfaction with 
methods of evaluation (28; 11 ) and some attention to identifica- 
tion of needs in the area of evaluation (21). Ancilyses of ex- 
isting rating scales for the evaluation of student teaching 
been carried out (5; 3^)* Personality characteristics of infer- 
ior and superior student teachers were delineated (32; 29; 23). 
Student teachers were evaluated in terms of strengths and weak- 
nesses (26), and a study was conducted inquiring as to what 
school administrators desired to know about a prospective teacher 
employee (25). 

Teacher Effectiveness . In reviewing literature on teacher 
effectiveness three basic approaches to its measurement were 
identified: evaluation of pupil growth, evaluation of teacher 

qualities thought to influence the teaching-learning process, 
and appraisal of the teaching process. The measurement of 
teacher effectiveness has been plagued by the problem of 
validity of the criterion of teaching effectiveness. Seme 
researchers feel that consensus of authorities in the field 



is the test procedure (3) while others are investigating objec- 
tive measures (2; 13). As yet, none of the objective means seems 
to have value for appraisal of student teaching. Measurement of 
teacher qualities thought to Influence the teaching— learning pro- 
cess has usually taken the form of a rating scale formulated by 
teacher educators (l; 35; 19). Appraisal of the teaching process 
has involved measurement of classroom behavior quantitatively 
and by rating devices (6; 22). The teaching process seems most 
readily evaluated through the analysis of teaching behavior since 
behaviors can be observed and personality as it enters behavior 
can also be assessed (22; 8). Results of these studies in general 
have yielded inconclusive evidence about teacher effectiveness. 

Crit ical Incident Technique t The critical Incident techni- 
que involves the identification of behaviors that are either out- 
standingly effective or ineffective as the individual performs 
the Job under scrutiny. This methodology has been used exten- 
sively in the analysis of many Jobs and provides information re- 
garding the requirements of a position (l6). Although Flanagan 
suggested the technique requires only simple Judgments on the 
part of the qualified observer, and thus is easy to use, Mayhev 
(20) pointed to some difficulties in using the critical incident 
technique. Advantages of this method cure its provision of actual 
materials for use, and valid bases for construction of evaluation 
instruments. A study by Blank (9) employed the critical incident 
technique to detennine characteristics of effective and ineffec- 
tive teaching of physical education instructors. 

Rating Scales. The rating scale classified as graphic was 
used in this study. Such a scale consists of a continuum pre- 
sented to the subject with preassigned bench marks, to be marked 
according to the rating deemed most accurate. This form of rat- 
ing scale is felt to be simple and easy to administer, interest- 
ing and quick to work with U7). Criticism of rating scales have 
Included i^ack of satisfactory validity criteria, subjectivity, 
and difficulty of establishing the validity of the totel rating. 
Use of critical incident material based on student teacher's ob- 
served behavior took care of the validity criticism; subjectiv- 
ity was handled by attempting to define as precisely as possible 
the dimension of the student teaching experience to be rated; 
validity of the total rating was investigated by correlating tbe 
summed scale scores with student teaching grade. In that a rat- 
ing scale l3 geared to standardize both what is observed and how 
it is appraised, it is felt that this is the most useful instru- 
ment for securing a quantitative appraisal of student teaching 
(*♦). 



Because student teaching is asserted to be the most important 
experience in teacher preparation, the scarcity of literature on 
the evaluation of student teaching was unexpected. The problem 
of evaluating student teacher performance is complicated by the 
fact that researchers have not reached definitive conclusions 
about teacher effectiveness. Rating scales appear to be the best 
method of evaluating student teachers* performance, and the 
criticeQ. incident technique seems to hold promise for indicating 
the behaviors of the student teacher that should be evaluated in 
this manner. 



METHOD 

The major problem with which this study was poncerned was the 
construction of a rating scale to serve as a summary evaluation of 
home economics student teachers* performance in student teaching. 
Flanagan *s critical incident technique was utilized as the means 
by which data on actual effective and ineffective behaviors of 
student teachers were secured and thereafter utilized as a valid 
base for the construction of the scale. 

Sample and First Data Collection . Colleges and universities 
preparing home economics teachers in New York State, the New Eng- 
land States, Pennsylvania, New Jersey, Ohio, Indiana, Tennessee, 
Kentucky, and West Virginia were asked to cooperate in this re- 
search. (See appendix for list of participating institutions and 
letter to participants.) Cooperating teachers, college supervisors 
and student teachers of 20 Institutions participated in the collec- 
tion of critical incidents during May 196^, and from October 1963 
through February 1966. (Materials used in the collection of criti- 
cal incidents are appended.) 

Analysis of Critical Incidents. All critical incidents re- 
ceived were analyzed to separate out single specific behaviors; 
the behaviors were then classified into ten major categories that 
emerged as the Incidents were studied, A randomly selected 15 
percent of the incidents were submitted to a panel of five heme 
economics teacher educators who classified them according to the 
ten categories. The majority €igreement of the consultants was 
used as the accepted categorization for the incidents reviewed. 

(The appendix contains a list of the categories of student teacher 
behavior derived from the critical incidents and an example of the 
analysis of an incident.) 

Construction. Use and Analysis of Preliminary Rating Scale . 
Following analysis and categorization, the specific behaviors were 
utilized in the construction of a rating scale. Ineffective behav- 
iors reported formed the level one description on the scale, the 



effective behaviors reported were utilized in the level five descrip- 
tion whenever possible. The level three descriptions were written 
so as to strike a middle of the road position between levels one 
and five. Based on the critical incident data received, 112 rating 
scale dimensions emerged to form the first stage rating scale - 
hereafter referred to as RS-112. Content validity is based on the 
330 critical incidents of student teachers' behaviors. Search was 
made for instruments to establish concurrent validity of rating scale 
items but satisfactory instruments were found not to be available. 

Twenty institutions (not the identical 20 furnishing critical 
incidents) collected data frcm the use of the first-stage rating 
scale by cooperating teachers, college supervisors and student 
teachers. Reliability of RS-112 based on 399 scales was determined 
in three ways: (1) computation of the correlation of all matching 

college supervisor-cooperating teacher ratings (N«125 pairs), i.e., 
inter-rater reliability, (2) correlation of the sum of the even 
scores with the sum of the odd scores, i.e., split-half reliabil- 
ity of the grader group (N«19^ persons responsible for grading 
student teachers) and (3) computation of the index of discrimi- 
nating power of each of the items. 

The Pearson product moment correlation was chosen for those 
ancdyses involving correlations. Rho was considered and discarded. 
The basis for this decision was that it was easier to mako the 
assumptlcn i;hat the Intervals in the data (rating scale and student 
teaching grades) were equally spaced than that the distance between 
ranks was equal. The median correlation was used to compare groups 
of correlations with each other. 

The student teaching grade was used as a criterion measure of 
success in student teaching. Standardization of student teaching 
grades was Implemented by a z»score conversion employing mean scores 
on the rating scale and mean student teaching grades for each ins- 
titution and the total sample. 

Production. Use and Analysis of Second Stage Rating Scale . In 
order to ascertain the items identifying those teacher behaviors 
most predictive of student teaching success, the second stage rating 
scale was constructed. Darlington's method (lU) based on the work 
of Thorndike (30) was utilized to reduce the number of items in 
RS-112 to the "best" predictors of the criterion. The Darlington 
method yields information about the ability of an item to increase 
the validity of a test as determined by item ^:alidity and item^test 
correlation. The procedure involves (l) selecting the most valid 
items to form the basic test, (2) measuring the potential cf each 
item in the pool to increase the validity of the basic test, and 
(3) adding several items with the highest potential to detexmine if 
in fact they do increase the validity of the test. The validity 



coefficient it a multiple correlation of the items and the criterion. 
The Darlli^on method yeilds high validity coefficients upon cross 
valldavion and vith lets shrinkage of items than is usual for the 
method of multiple regression analysis. By nature of its fimction, 
multiple regression analysis picks the "best" items ffom a highly 
correlated pool» and thereby disregards other highly inter-corre- 
lated items in the pool. With the Darlington method, those other 
highly inter-correlated items in the pool still have a chance of 
coming into the test and contributing something to its validity. 
Although this method vas the chief analysis of the study, a regres- 
sion analysis vas also executed. Thirty-five items emerged from 
the Darlington analysis; four additional items, as veil as 10 
contained in the 3$ items above, came from the regression analysis 
as significant for the evaluation of student teaching in home 
econosdcs. These constituted the revised or second-stage rating 
scale. 

Data frosi administration of the second-stage rating scale in 
participating institutions vere analysed using the procedure folloved 
for RS-112 vith the exception of the Darlington and multiple regres- 
sion analyses. The final form of the Darlington program for RS-112 
(the 3$ items as ordered by the partial correlation and validity pro- 
grams) vas used in the analysis of the second stage rating scale, 
and the ik items emerging from regression analysis vere utilized in 
the analysis of the second stage rating scale. The detemination of 
the significance of the difference of correlations obtained vith 
RS-112 data and those of RS-3$ data vas faicllltated by use of 
Fisher's s-test. In addition the scores on the 3^ items vere sunmed 
and the sum vas correlated vith the student teaching grade. 

Tvo oplnlonnaires vere formulated, one to determine feelings of 
student teachers and the other the feelings of cooperating teachers 
and college supervisors tovard the rating scale. The opinions de- 
sired related to comparison vith evaluation devlvces wesently in 
use, and time required to fill out the instrument. \8ee appendix 
for copies of oplnlonnaires.) 



RESULTS 

Critical Incidents . Of the $63 critical incidents collected, 
330 vere usable. Nearly half vere reported by cooperating teachers, 
student teachers submitted about one-third, and college supervisors 
about one-fifth. Student teachers reported approximately equal num- 
bers of effective and ineffective incidents, vhlle both cooperating 
teachers and college supervisors contributed more ineffective than 
effective incidents; slightly less than half of the reports in the 
total sample vere made of effective behavior. 






study of the ^50 incidents revealed 9^8 specific behaviors. 
Classification according to behaviorail areas of a randomly selected 
1^ per cent of -^he Incidents by a panel of home economics teacher 
educators was largely in agreement with the authors' classifications 
of specific behaviors. 

First Stage Rating Scales . The critical behaviors formed the 
basis for construction of a rating scale for evaluation of student 
teachers in home economics, initially consisting of 112 items. Five 
hundred, ninety-tvo rating scales were distributed for administration 
at the end of the student teaching period. (The Appendix contains 
a copy of RS-112 with inter-rater reliability, correlation with stu- 
dent teaching grade, and index of discrimination figures for each 
item noted in the margins.) Of the returned, 399 (68|K of rating 
scales sent) were usable. Student teachers completed 12U of these, 
cooperating teachers 139 » ond college supervisors completed 136. 
Student teachers tended to rate themselves higher than their cooper- 
ating teachers and college supervisors rated them. Cooperating 
teachers made more use of all levels of the rating scale than did 
either college supervisors or student teachers. The upper end of the 
Instrument received greatest usage with 33 per cent in each of the top 
two catqsorles (U and 3). Approximately 20 per cent of all responses 
were found in level 3. About 3 per cent of the ratings were either 
"no observation" or "no answer". 

Reliability of the 112 i = em instrument was ascertained by 
inter-rater, split-half and item discrimination methods. Inter-rater 
reliability on the rating scale was computed on data from 123 cases 
of matching college supervisor and cooperating teacher. The coeffi- 
cient of reliability for all variables was .U2, but when stepped-up 
by the Spearman Brown prophecy formula, rose to .39* The split-half 
method of detemlnlng Internal consistency was computed on 19U 
graders' date; the coefficient was .97* When stepped-up by the 
Speaxman Brown prophecy formula, this correlation became .98. Cross 
veJ.ldatlon was done by dividing the entire group alternately in 
half. The cross validation split-half reliability for Graders A 
(Ns 97) was .96 and for Graders B (N»97) was .96. The cross validation 
shows that the coefficient of split-half reliability can be accepted 
with confidence. 

The Internal consistency of the first stage rating scale was 
also tested by the use of the Index of Discrimination. The statistic 
showed the amount of differentiation of the 112 ratings between the 
high and low 2'f percent of the student teachers as determined by stu- 
dent teaching grade when the grader data (N-19^) were analyzed. The 
mode for the Indices of discrimination (71 percent) fell in the 
.20 to .29 range, low but acceptable. There were 21 items with in- 
dices below .20 putting them in the questionable class. Despite the 
generally low discrimination values, all these indices were positive. 



The major analysis of RS-H2 utilized the Darlington procedure 
to determine the items significant for the evaluation of student 
teaching in hone economics. This analysis used vith student grades 
adjusted on the basis of mean ratings of institution in idiich th^ 
occurred resulted in a 35 item scale having a validity coefficient 
(multiple correlation vith criterion) of .85. The same procedure 
using student teacher grades as given, yielded a validity coeffi- 
cient of .83. The items identified vith one asterisk on the 112 
item rating scale in the appendix are the 35 items resulting from 
the Darlington analysis. 

A stepvise multiple regression analysis was also carried out 
on RS-ll2. A ik itan scale vith a multiple correlation of .87 re- 
sulted. All of the ik items had F levels significant at .01. Ten 
of these items had also been derived in the Darlington analysis. 

The items on the appended 112 item scale marked vith tvo asterisks 
vere those selected by the regression analysis. 

Second Stage Rating Scale. Second stage rating scales (389) 
vere distributed (to be used vith nev student teachers even though 
seme of the same Institutions participated). Sy the cut-off dead- 
line, 276 (71 percent) of these had been completed and returned. 
(The Appendix contains a copy of RS-35 vith inter-rater reliability 
correlation vith student teaching grade cuid index of discrimination 
figures for each item noted in the margins.) Of the total, 91 vere 
completed by student teachers, 99 by cooperating teachers and 86 by 
college supervisors. Again the upper levels vere used more heavily 
vith 28 percent of the ratings falling at the 5 level, 35 percent 
at ^ level and 26 percent at level 3. Although levels 1 and 2 vere 
still not used extensively, a better balance of levels 3, ^ and 5 
vas achieved vith the second stage rating scale. 

Reliability of the second stage rating scale vas determined by 
Inter-reter, split-half, and item discrimination methods. Correla- 
tion of 61 matching cases of college supervisor-cooperating teacher 
data yielded a coefficient of .U8, as compared vith the inter-rater 
reliability correlation coefficient of .38 for RS-112. When the 
inter-rater correlation of .i»8 for RS-35 vas stepped-up by use of 
the Spearman-Brovn prophecy formula, the coefficient rose to .65 
vs. the stepped-up coefficient of .59 for RS-112. Hovever, the 
differences betveen these correlations vere not statistically sig- 
nificant vhen Fisher's z-test vas applied. The inter-rater reli- 
ability coefficients for each of the items in RS-*35 are noted on 
the scale appended. 

Execution of split-half reliability yielded a coefficient of 
.93 vhen carried out on the 122 graders' data. The split-half 
coefficient for RS-112 vas .97. Whei? Fisher's z-test vas applied 
to determine the significance of the difference, the results vere 



found to be significant at the one percent level* When the spilt 
half reliability coefficient of RS-33 vas stepped-up with the 
Speaman-Brovn formula » the coefficient became *96 idilch vas still 
significantly different at the one percent level from the stepped- 
up coefficient of *98 resulting from RS-112. 

Cross validation of RS-33 reliability vas carried out by di- 
viding the grader group alternately in half. The coefficient for 
Graders AA (^N«6l) vas ,88 and that for Graders BB (N*61) vas .93. 
Utilizing Fisher's z-xcst, comparison of split-half reliability 
coefficients for Graders Ak and Graders BB vith all Graders on 
RS-3$ data yielded non-significant differences from the overall 
split-half reliability correlation of «93. The internal cross 
validation of RS-3$ grader data indicate that the split-half re- 
liability coefficient can be accepted vith confidence. 

The Index of Discrimination vas computed for the 3$ items in 
the second stage rating Beale plus the additional k from the re- 
gression analysis using the high and lov 27 percent of the student 
teachers as detemlned by student teaching cprade. The mode for the 
cross validation data lay in the same *20 - .29 range as vith RS-^ 

112, but vith RS-33» 60 percent of the indices vere in the *20 - 
•29 range vhereas vith RS-112, 71 percent vere in that range. The 
•30 - .39 range nov held 37 percent of the indices as compared vith 
RS-112 vhere only 10 percent of the indices vere in that category. 

In RS-33 only one item (3 percent) had an index belov .20 and thus 
vas questionable compared vith 21 items (19 percent) on RS-112. 

The findings shov that the items on RS-33 had better discriminating 
pover than those on RS-112. As vith the RS-112 data, all discrim- 
ination Indices, although lov, vere positive. 

The comparison of the student teacher's perception of herself 
vith ratings of college supervisor? and of cooperating teachers for 
RS-S? data yielded correlations pf .26 (N«60 matching pairs) and 
•21 (Re70 matching pairs) respectively. The conqparable figures for 
RS-112 vere .22 and .29; there vm no significant difference betveen 
the correlations resulting from the tvo data collections. 

The validity coefficient (multiple correlation vith student 
teaching grade) resulting from resubmitting the Darlington order and 
program on the 35 item second stage rating scale data vas .78. This 
vas not significantly different from that of .83 obtained vith RS-112 
data. An ixterestlng facet of this analysis vas that the validity 
coefficient built to .8U with 1^ items. Table 1 presents the validity 
coefficients for each item in RS-33. 
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TABLE 1. VALIOr|!Y COSPPICIERTS OF ITEHS IE R8-3S IN ORDEN OETERNINED 
BY ANALYSIS OP RS-I12 





Or^er Number 


Item Number 


Validity Coefficient 


1 


20 


.67 


2 


31 


.73 


3 


lU 


.80 


k 


17 


.81 


5 


10 


.81 


6 


19 


.81 


7 


22 


.83 


8 


12 


.82 


9 


18 


.8U 


10 


15 


.8U 


11 


27 


.8U 


12 


7 


.8U 


13 


21 


.8U 


Ik 


33 


.8U 


15 


8 


.8U 


16 


13 


.83 


17 


2k 


.83 


18 


25 


.82 


19 


26 


.81 


20 


3k 


.79 


21 


5 


.77 


22 


k 


.76 


23 


1 


.78 


2^ 


2 


.78 


25 


32 


.79 


26 


28 


.81 


27 


U 


.81 


28 


9 


.79 


29 


6 


.80 


30 


23 


.81 


31 


29 


.81 


32 


35 


.81 


33 


3 


.80 


3U 


16 


.79 


35 


30 


.78 
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Stepwise multiple regression analysis using the Ik items that 
were derived from the multiple regression analysis of BS-112 was 
carried out. Five items emerged with a multiple correlation of .85. 
There was no significant difference between the multiple R .89 ob- 
tained for fourteen items from RS-112 data and the .85 obtained from 
the five items trcm RS-35 data. 

An index of validity was secured by correlating the summed 
scale scores for RS-3? with the adjusted and with the unadjusted 
student teaching grades. The resultant correlations were .84 for 
the adjusted grade and .79 for the unadjusted student teaching 
grade. 

Ojoiiiionnaire^^Data. Sixty percent of the college supervisors 
and approximately half of the cooperating teachers reported use of 
a rating scale as the present instrument for measuring a student 
teacher's performance. Some of these used descriptive categories; 
others used descriptive adjectives as superior, good, poor. 

When asked whether their present instrument or RS-35 provided 
a better basis for grading the student teacher, about half of the 
college supervisors and cooperating teachers felt that RS-35 was 
superior. Approximately a third of the cooperating teachers and 
somewhat more than a thir4 of the college supervisors thought that 
both Instruments provided the same basis. 

About a fourth of the cooperating teachers and one third of 
the college supervisors felt that RS-35 took less time to complete 
than the rating scale they were presently using. Approximately 
half felt that RS-35 and the presently used scale took about the 
same length of time to complete, and slightly more than a fifth of 
the cooperating teachers and slightly less than a seventh of the 
college supervisors thought the scale they were presently using 
required less time to complete. The majority of respondents had 
no objection to the amount of time required to fill out the ins- 
trument and noted that 11-20 minutes was enough- 

In general both cooperating teachers and college supervisors 
felt RS-35 and their own scales were equally easy to use. Only 
one fourth of the cooperating teachers and a fifth of the college 
supervisors thought that their own rating scale was easier to use. 

When pressed to make the decision as to which instrument they 
thought better - all things considered - more than half the college 
supervisors and coopezttting teachers said RS-3^ was better. How- 
ever, slightly more than a third of both college supervisors 
cooperating teachers felt that the scale they were using and RS-35 
were of the same quality. 
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Approximately tvo-thirds of the cooperating teachers and col- 
lege supervisors stated that if R6-35 vere available for their con- 
tinued use» they would use it. Approximately one fourth said they 
would use RS-33 but made some qualification. 

Student opinionnaire data indicated that the student teachers 
did not object to the length of time required to fill out the 
rating scale - again for the majority, the time required was 11-20 
minutes. When asked to check their feelings about using R8^33, 
more than half of the student teacher responses pointed up the 
positive and objective basis for looking at performance. About 
one fifth of the responses indicated they felt that it was pro- 
fessionally important, and one fifth of the responses centered 
on the quickness with which the scale could be ccmipleted. Very 
few student responses were negative; those that were indicated 
a dislike for rating self on ai^ instrument. 



DISCUSSION 

As the sanqple in this study was limited to those institutions 
that agreed to participate, the results can only be applied to that 
group. However, it is felt tluit a diversity of colleges and univer- 
sities participated in the study and that the rating scale would 
yield similar results with other samples. 

Critical Incidents . It was noted that the cooperating teacher 
reported the most critical incidents, nearly one half of those re- 
ported. The student teachers submitted somewhat less than one third 
and the college supervisors contributed somewhat less than one fifth. 
The fact that cooperating tegchers contributed the most critical 
incidents can probably be attributed to their being on the Job with 
the student teachers each day and to their continual observation of 
student teachers* behavior. The college supervisors contributed 
fewest critical incidents, very 1: :3ly due to her more limited con- 
tact with the student teacher during the student teaching experience. 

For the total sample, slightly less than half of the reports 
were made of effective behaviors. Perhaps more ineffective behav- 
iors were reported because one has more of a tendency to note and 
report ineffectiveness. This finding however was not true for 
student teachers who reported an equal number of outstandingly 
effective and ineffective behaviors. 

On categorizing the critical incidents the classification with 
the most number of incidents reported was found to be **Adaptation 
to Students* Level". Nearly a tenth of the total number of incidents 
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fell in this category. Of these 88 critical behaviors, about three- 
fourths indicated ineffective behavior. These data indicate a need 
for better acquainting the student teachers with the student they 
will teach. 

Three other categories in the top ten frequency also contained 
a greater number of ineffective than effective behaviors. **Teacher 
Direction and Supervision" ranked third; there was a preponderance 
of five to one ineffective incidents. "Accuracy and Conceptual 
Development of Subject Matter" also included a majority of ineffec- 
tive behaviors reported, as did "Organization of Time: Classrocm". 

Three categories included approximately equal numbers of effec- 
tive and ineffective beha/lors reported. These categories vere, "Use 
of Techniques and Methods", "Ability to Obtain Student Involvement", 
and "Classroom Control". 

Three categories included reports of more effective than ineffec- 
tive behadors. These categories vere: "Behavior in Emergency or 

Unexpected Situations". "Use of Motivation/Interest Approach", and 
"Personal Qualities". 

First Stage Rating Scale. In the distribution of ratings on 
RS-112 the greatest percentage for "no answer" and "no observation" 
cells was reported by the college supervisor. This was probably due 
to the fact that the college supervisor visited the student teacher 
only occasionally and did not, therefore, have enough information 
to make some of the ratings. The cooperating teacher reported the 
next largest number of "no answer" and "no observation" reports, and 
the student teacher the least. This should be expected as the student 
teacher was rating herself, and therefore . she would be in possession 
of more information about her perfonnance as she perceived it. 

The greatest percentage of missing observations was in the item 
describing rapport with parents. This finding may be due to the lack 
of contact of student teachers with parents. O^her missing observa- 
tions were in the categories dealing with performance in supervision 
and classrocm control, care of the department, rapport with faculty, 
extra curricular activities, and professional attitude and Judgment. 
The nature of these categories may account for difficulty in rating. 

The student teacher tended to rate herself higher than did the 
cooperating teacher or the CQllega supervisor. Perhaps student 
teachers were disposed to be generous in their self appraisals as 
they had survived three or more years of academic wprk and any 
screening of candidates for student teaching. If they viewed aca- 
demic achievement and student teaching performance as requiring 
similar effort, the halo effect would carry over. 
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In the item analysis of RS»'11.2 data the mode for the indices 
of discrimination lay in the *20 - *29 range, lo¥ but acceptable. 
Possibly the reason for low discrimination is that many incompetent 
prospective teachers are diverted into other areas before the stu- 
dent teaching experience. Secondly, perhaps student teachers tend 
to be a hcmogeneous group. There v^as not much range in student 
teaching grades betveen he high and low groups. 

The correlation of college supervisor-student teacher ratings 
by inAitution ranged from -.06 to .^9» averaging .22, indicating 
that student teachers viewed their performance differently from the 
college Supervisors. The averaged correlation of cooperating teachers 
and student teachers ratings yielded a coefficient of .29. The 
figure indicated a slightly better agreement of cooperating teacher 
and student teacher than of college supervisor and student teacher. 

Second Stage Rating Scale . Analysis and refinement of RS-112 
yielded the second stage rating scale, RS-3$ with at least one item 
in each of the ten major behavior categories. Again the college 
supervisors reported the greatest number of *'no answer" and "no 
observation". The cooperating tec^her had submitted a lesser per- 
centage and the student teacher the least. This was the same trend 
as had been noted on the RS-112> data. The percentage for "no obser- 
vation" and "no answer" was considerably lower with the second rat- 
ing sca3e for all three groups . This finding can probably be attri- 
buted to the statistical procedure which culled out th.? jnresumably 
less inqportant and lesr frequently seen behaviors. 

The student teacher still tended to rate herself higher than 
did the cooperating teacher or college supervisor. With the RS-3$ 
analysis it Was impossible to determine whether cooperating teachers 
or college supervisors made more use of all levels on the rating 
scale as the percentage for these two groups was very similar; whereas 
in RS-112 the cooperating teachers made most use of all levels. The 
selection of items on RS-33 is probably responsible for this change. 

Comparisons of RS-35 Items with Qualities of "Good" Teachers 
as Reported in the Literatiure . Some c^haracteristics of "good" 
teachers as reported in the literature reviewed for this study 
emerged in the critical incidents, survived the eliminating due to 
statistical analysis of RS-112, and appear in the final version of 
the rating scale. 

Teacher enthusiasm appeared in the second stage rating scale. 
Mastin (19) studied teacher enthusiasm and found that students 
learned more when the teacher was enthusiastic than when she was 
not. Enthusiasm was also one of the qualities that percent of 
school administrators waited in a prospective employee as researched 



"by Rhodes and Peckham (25)* Alny and Sprenson (l) Incorporated this 
characteristic in their rating scale for teachers; they had found 
enthusiasm to be one of the traits contributed most frequently by 
competent educators as an important teacher attribute. Peronto (2U) 
analyzed performance records of social st\idies teachers and found 
that the "good" ones were enthusiastic. 

Patience, described in one item» vas found mentioned in liter- 
ature reviewed only once* It vas among traits contributed most 
frequently by competent educators and therefore, included in the 
Almy-Sorenson Rating Scale for Teachers (1)* 

Peronto (24) mentioned better speaking voice as differentiating 
the "good” from the "poor” teachers he stuped. The item describing 
pronunciatior and use of expressions, and the item ateut pitch oi 
voice might be Considered parts of Peronto 's "better speaking voice" 
category. When Barr (7) summarized studies of teaching efficiency, 
he also included skill in speech. 

Among the findings of Debemardi (U) and Schultz and Ohlsen 
(26) about "good" teachers vas the ability to plan ond organize work. 
Rhodes and Peckham (25) stated that school administrators looked for 
a prospective teacher vho had ability to plan end motivate lessons. 
Perhaps the planning ability mentioned by these investigators vas 
similar to some of the behaviors mentioned in several RS-35 items 
idiich related to lesson plfuming* 

Barr's (7) reference to the ten skills of the "good" teacher 
contained one skill relating to "setting and defining goals" which 
vas described by one of RS-35's items. 

Skill in planning or poking questions vas evaluated by two 
items. This ability vas mentioned in the literature by Barr (6) 
and Peronto (24)* 

Choosing learning experiences vas a skill of the "good" teacher 
according to Barr (7). This vas described by three items in RS-35* 

The planning of evaluative procedures vas the subject of one 
item and vas supported 5n the literature as being important for 
teaching effectiveness by Barr (7)* 

A number of investigators mentioned ability to stimulate inter- 
est or to motivate pupils. Among them were Barr (6; 7)» Schiiltz and 
Ohlsen (26), and Rhodes and Peckham (25)* The motivation factor vas 
assessed by two items. 

Alertness to student needs, measured by another item, vas de- 
lineated hy Barr (7) as an important skill of the teacher, and by 
Debemardi (ll). 
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The Isqportanee of knovledge and choice of auhject matter for 
good teaching vaa stated hy Barr (6), Barr and Emans (31), Deher- 
nardl (11), and Peronto (2if)« Several Items assessed wcme of the 
behaviors In this area. 

Instructional skill as mentioned hy Barr (7) and Barr and Emans 
(3i) and professional competence as reported hy Rhodes and Peckham 
(2^) as attributes of **good'^ teachers vere broad terns, and numerous 
items or perhaps most Items might be construed as fitting under 
these categories^ 

Involving pupils In classroom activities as mentioned by Schultz 
and Ohlsen (26) vas measured by one Item In RS»33. 

Only one Item measured classroom control; this teacher behav- 
ior vas found to be one of the 10 most frequently Incorporated In 
teacher rating scales according to Barr and Emans (31), and vas 
mentioned by Debemardi (U)« 

Care of the department vas the subject of ope Item* Barr and 
Shkins (31) noted that this vas one of the categories frequently 
found In teacher rating sqales* 

Pupil-teacher rapport charaeterlaed the "good” teacher as Indi- 
cated by Barr (7), Debemardi (ll), Rhodes and Peckham {2^^) and Hearn 
(U) If rapport can be equated vJth Interest In pupils. Tvo rating 
scale Items measured teacher-pupil rapport* 

Prcf^E^slonal attitude vas cessed hy an item In RS-33 and vas 
be .ibipoi^i^ant by Debemardi (ll). It vas also found as one 
cf Che ten xiost frequently described categories In teacher rating 
(scales as analyzed by Barr and i^nans (31). 

Ideas In the rating scale Items found to be significant for the 
evaluation of effectiveness In student teaching that vere not supjxorted 
by the literature revleved Included: provision of meaningful exper- 

iences and change of pace; making available ne'^^ssary and appropriate 
materials; provision of experiences to facilitate transfer of learning; 
control of movement of lesson; ability to assume full class load; 
preparation of assignments for students vlthout vork; reliance on 
cooperating teacher; assistance and participation In extra curricular 
activities* 

Some characteristics of "goo^" teachers vere reported In the 
literature but did not appear In the second stage rating scale* Barr 
(6) and Hearn (ll) thought provision for Individual differences an 
Important characteristic of the "good” teacher. Although this behavior 
did not emerge In the socond stage rating scale, the Idea appeared 
in three Items of the orlginid Instrument. 
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Barr (6) described a **good” teacher as requiring a notebook and 
outside reading. The keeping of a notebook tends to be a bit passi 

at present 2 and the concept of outside reading appeared in no critical 
incidents in this study. 

The objective teacher characteristic of standing most of the 

period as noted by Barr (6) did not appear in any of the present 
research. 

The concept of attractiveness or appearance was mentioned by 
Charters and Waplos (12) and Barr and Qnans (3l). The study by 
Charters and Waples concentratfi on sjuch characteristics as care- 
fulness, haith, ppenmlndedness , promptness, refinement, and thrift. 
Barr and Emans (3l) also listed health. Since these are teacher 
characteristics and not behaviors involved in the teaching process, 
the concepts did not emerge in the present study. 

Personal habits, a rather nebulous designation, was enumerated 
by Barr and Emans (31) as a characteristic of a **good” teacher. 

Since it is difficult to know what these two men had in mind, it is 
impossible to seek comparisons in this research. 



Hatcher (l8) found that **good” teachers cooperated extensively 
with other departments and gained the cooperation of other faculty 
members in their programs. While this idea did not emerge in the 
second stage rating scale, it appeared in the initial set of items. 

Itoowledge of mental hygiene was listed as an Important charac- 
teristic of the good teacher by Peronto (2l»). This teacher char- 
acteristic, not behavior, was not enumerated in the present research. 



CONCLUSIONS 

This study has demonstrated that a valid, reliable useful 
rating scale has been constructed for the evaluation of student 
teaching in home economics. Critical incidents collected from a 
diverse sample of cooperating teachers, college supervisors, and 
student teachers provided an TOlJ^ical base for construction of 
the rating scale. 



- analysis of a second stage rating scale, 

RB-35* yielded almost completely satisfactory cross validation of 
resets of RS-112, the first rating scale constructed. The overall 
split-half figures for the two scales were in the 90 *s indicating 
a nirb degree of consistency with Tdiich any participant rated. 
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Item discrimination indices for RSr3$ were better than those 
for RS-112, but the m^e for both instruments vas the i20 to .29 
range. Perhaps the discrimination indices vere low because poten- 
tially low student teachers vere directed out of the program 
before the student teaching experience. 

Inter-rater reliability coefficients vere not significantly 
different for RS-33 and RS-112. Although one vould like to see 
>igreement higher » .39 and .63 are not unacceptable figures. 

The second stage scale vas said to require generally less than 
tventy minutes to complete. Approximately two thirds of cooperating 
teachers and college supervisors said that they would use the scale 
if it vere available for continued use. Approximately one fourth 
made some qualifications about continued use of the scale. 

The method followed in the development of the scale certainly 
has merit and could beneficially be adopted by rating scale con- 
structors in other fields and for other purposes. 

The diverse types and geographical locations of institutions 
in the sample, and satisfactory cross validation suggest the findings 
of the study would probably apply to the evcduation of most home 
economics stxident teachers. 



SUMMARY 

The primary concern of this study vas the construction of a 
rating scale for the evaluation of student teaching in home economics, 
in particular those aspects that are critical in student teaching 
performance. The problem vas approached by means of the critical 
incident technique. Five hundred and titty usable reports of out- 
standingly effective or ineffective student teacher behaviors (9^3 
critical behaviors) vere collected from college supervisors, coop- 
erating teachers and student teachers; these vere categorised 
according to specific behaviors, and a randomly selected 13 percent 
of the data submitted to a panel of home economics teacher educators 
to check agreonent in classification. The researchers and the panel 
vere in agreement 66 percent of the time. 

The critical behaviors provided a valid base for a rating scale; 
they vere used to describe levels of quality on a five point continuum 
of 112 unidimensional items. Tested in the field by a self selected 
sample, percentage distribution of 399 ratings shoved 70 percent in 
levels ^ and 3. Cooperating teachers made the greatest use of all 
levels. Student teachers tended to rate themselves higher than either 
cooperating teachers or college supervisors rated them. 
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Reliability of the instrument was tested by inter-rater, split- 
half, and item discrimination methods. The inter-rater correlation 
stepped-up by use of the Spearman-Brovn prophecy formula was »39 for 
123 matching college supervisor-cooperating teacher pairs. The 
stepped-up split-half reliability coefficient was .96 for I9U graders - 
those responsible for giving the student tecushing grade. Item 
discrimination analysis shoved a modal response or 71 percent of the 
indices in the .20 to .29 category - low but acceptable. 

The major analysis of the first stage rating scale was the 
Darling won program vhich involved partial correlations and validity 
coefficients (multiple correlation with the criterion, student teach- 
ing grade). The grader data (N«19^) had student teaching grades 
adjusted on the basis of mean ratings of the institution in vhich 
they occurred as compared vith the overall mean ratings, to take care 
of the variance of grades among institutions. The resulting validity 
coefficient vas .85 for 35 items. When the Darlington program vas 
run vith the 35 items and the student teaching grades as giv&n, the 
resulting coefficient vas .83. 

A stepvlse multiple regression analysis vas carried out vhich 
yielded fourteen significant items vith a multiple R of .87. Ten of 
these items had appeared fjrom the Darlington analysis; the four not 
so generated vere Included vith the 35 items in order to examine 
their cross validation performance. 

The second stage rating scales and opinionnaires vere administered 
to a self selected and Independent sample. By the deadline for data 
collection, 276 rating scales had been returned. The same analysis as 
delineated for RS-112 vas carried out. The difference betveen corre- 
lation coefficients vas tested using Fisher's z. 

The greatest percentage of ratings fell in level U vhile levels 
3 and 5 tended to be almost equal, indicating RS-35 bed a better dis- 
tribution of ratings than RS-112. As vith RS-112, stuaent teachers 
tended to rate themselves higher than either cooperating teachers or 
college supervisors rated them. With RS-35 no one group could be said 
to have made use of all scale levels. 

The stepped-up inter-rater coefficient for RS-35 vas .65, not 
significantly different from the .59 of RS-112. The stepped-up 
split-half reliability correlation vas .96 vhich vas significantly 
different at the one percent level from .98 for RS-112. Hovever, 
the Internal cross validation of RS-35 split-half correlation yielded 
results not significantly different from those of the vhole group. 

Item discrimination vas better than that found for RS-112, but the 
mode vas still in the .20 to .29 range. 
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Running the Darlington program on RS-3^ data yielded a validity 
coefficient of .78 vhich vaa not aignificantly different from the 
figure of .8^ for RS-112. Hovever, the figure of .73 ^ich resulted 
from use of the unadjusted student teaching grades vas significantly 
different at the five percent level from .83 for RS-112. 

When the l4 items generated from the stepwise regression ancd* 
ysis of RS-112 were used in the analysis of RS-3^» five of these items 
emerged with a multiple correlation of .8^ as compared to .87 with 
the 1^ items in RS-112 analysis. 

An index of validity vas secured hy correlating the summed scale 
scores for RS-3^ with the adjusted student teaching grade which 
yielded a correlation of .8^. When the same procedure vas followed 
with the unadjusted student teaching grades, the correlation vas ^.79* 

The majority of cooperating teachers and college supervisors 
reported use of a rating scale to evaluate student teaching. About 
half of the supervisors and cooperating teachers felt that RS-39 
provided a better basis for grading and somewhat more than half 
thought it had better all around qualities than their presently used 
instrument. Most ^o used the scale found that it took no longer 
than twenty minutes to complete; they did not object to the time 
required for its use. When queried about continued use of the rating 
scsde if it were available, approximately two thirds of cooperating 
teachers and college supervisors said they would use it, and approx- 
imately one fourth gave their qualified assent. More than half the 
student tecushers considered that the objective basis RS-3^ provided 
for looking at their behavior vas important. Approximately a fifth 
thought it professionally important, and another 20 percent stressed 
the ease and quickness with which RS-39 can be used. Negative student 
teacher reactions were very few. 

Recommendations 

Since RS-39 performed quite satisfactorily, it deserves further 
refinement and use. If a short rating scale is desired, for cm almost 
equivalent correlation with student teaching grade, the five items 
emerging from the multiple regression analysis are recommended. (See 
items Ih, 28, 31, 3^, 39 in RS-35, appended.) 

Should anyone be interested in a further refinement, the addi- 
tion of a descriptive level between three and five would be beneficicd 
since the upper end of the scale received extensive use. Possibly 
the rating system could be changed to one through ten, to enable the 
raters to differentiate more at the top of the scale. This would not 
be recommended, however, unless the additional descriptive level sug- 
gested were added. 
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A place should be made on the rating scale for ccmments by the 
rater regarding grovth of the student teacher. This is seen as a 
neans of making the evaluation more personal. 

Regarding the use of the rating scede, its recommended use is 
at the middle and conclusion of the student teaching period by all 
involved in the program. The scale shoiild be used independently by 
the college supervisor, cooperating teacher, and student teacher and 
should be followed by a conference so that the student \^eacher's abil- 
ity and progress can be eva^^ated effectively. Use at the recommended 
two times would necessitate a second column for rating. Use at the 
suggested times is not meant to preclude evaluation throughout the 
student teaching period. 

A page of pertinent information on the development of the rating 
scale hsbeen written. (See Appendix.) Acccmpanying the scale, this 
will give users seme concept of the basis for its construction and of 
its quality. 

Suggestions for further study arising from this research are: 

1. to determine ways to promote growth in the student teacher in the 
areas emerging in the scale and espacially in the five areas re- 
sulting from the multiple regression analysis. 

2. to train raters (possibly by analysis of a film of student teacher 
performance) in the use of the scale in an attempt to bring about 
better agreement. 

3. to develop other means of evaluating student teaching performance, 
e.g., use the same procedure to develop a rating scale to be used 
by pupils to evcdiaate the student teachers and compare results 
with those from this scale. 

U. to use the scale in the evaluation of teaching performance of 
first year teachers. 

5* to develop another scale for the evaluation of first year home 
economics teachers following thipi methodology. 
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GLOSSARY 



Student Teacher - a college or university student who teaches and 

carries out associated duties in a teaching cen- 
ter affiliated with the college; occasionally 
abbreviated as ST 



Cooperating teac her or supervising teacher - a school staff member 

who assists the college or university by directing 
the student teacher in her student teaching exper— 
ience; abbreviated as CT 



College supervisor - the university staff member who visits the 

student teacher in the student teaching center 
and who with the cooperating teacher guides and 
evaluates the student teacher’s performance; 
abbreviated as CS 



Student teaching center - a school which provides its facilities 

for the student teaching experience 



Critical incident - a critical incident involves the description 

of a student teacher’s behavior that has re- 
sulted in her being either outstandingly 
effective or Ineffective during the period of 
practice in which she gradually assumes the 
role of teacher 



Grader group - composed of all individuals who were responsible 

for grading student teachers 



First stage rati ng scetle - Original rating sccde of 112 items; 

abbreviated as RS-112 



Second stage rating scale - Revised instrument containing 33 items 

plus four; abbreviated as RS-35 
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INSTITUTIONS PARTICIPATING IN SOME OR ALL 
OF THREE DATA COLLECTIONS 



Connecticut 

Saint Joseph College 

University of Connecticut 

Indiana 

Ball State University 

Butler University 

Goshen College 

Indiana University 

St. Mary-of-the-Woods College 

Valparaiso University 

Kentucky 
Berea College 
Kentucky State College 
Nbrehead State University 
Western Kentucky University 

Maine 

University of Maine 
Massachusetts 

State College at Framingham 
University of Massachusetts 

Nev Hampshire 

Keene Teachers College 

Nev Jersey 

Douglass College (Rutgers Univ.) 
Montclair State College 

Nev York 

Cornell University 
Hunter College 
State University College, 
Plattsburgh 
Syracuse University 



Ohio 

Ashlemd College 

Bluffton College 

Kent State University 

Notre Dame College 

Our Lady of Cincinnati College 

University of Dayton 

Ursuline College for Women 

Pennsylvania 

Albright College 

Carnegie Institute of Technology 

Cheyney State College 

Indiana University 

Juniata College 

Mansfield State College 

Maryvood College 

Mercyhurst College 

Pennsylvania State University 

Seton Hill College 

Rhode Island 

University of Rhode Island 
Tennesse e 

David Lipscomb College 
Memphis State University 
Middle Tennessee State University 
Tennessee A&I State University 
University of Tennessee, 

Martin Branch 

Vermont 

University of Vermont 

West Virginia 
Marshall University 
West Virginia University 



Each data collection involved a different group of student teachers; 
hence they may >^e considered independent samples. 
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LETTER TO PROSPECTIVE PARTICIPANTS 



Home Economics Education 
Martha Van Rensselaer Hall 
Cornell University 
Ithaca, New York lUS^O 



Dear 



I am asking cooperating teachers, college supervisors, and student 
teachers to assist in research that is being conducted as a part of a 
larger project for the Ph»D» degree* Approval for your cooperation has 
been secured from the head of the home economics division at the college 
or university with idiich you aze connected* The study will ultimately 
be concerned with the construction of an instrument to evaluate student 
teaching in home economics* For the initial phase of this research, a 
study of the requirements of student teachers* behavior will be collected* 
The description of the technique that will be used in this study is: 



A critical incident involves the description of a 
student teacher's behavior that has resulted in her 
being either outstandingly effective or ineffective in 
a period of practice in which she has the opportunity 
of Integrating the various learnings of her professional 
preparation and thereby increasing her perspective of 
the total role of a home econcmics teacher* Some of the 
areas in which this behavior might occur are: 



1* performance in the classroom 
2* preparation for teaching 
3o relationships with faculty 
relationships with pcurents 
relationships with cooperating teachers 
6* extra curricular activities 
7* duties associated with teaching 



Your assistance is vitally needed in determining these incidents* 

You are asked to supply the names of the persons involved in the situation 
so that the critical incidents that are described by the cooperating 
teacher, college supervisor, and student teacher mey be matched* All 
data will be coded and handled confidentially; no names will be used in 

the report of the study* This study is to continue during the month of 
May* 
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Please describe the situation in terms of what was actually observed 
or done rather than inferred from what you saw or did. Do not states for 
example, that "the lesson p.lans of the student teacher. Miss Jones, the 
student teacher, did not orgpnize the content logically, lea'ning exper- 
iences were not varied, objectives vers not stated in behavioral terms, 
etc.** Another situation reported in terms of inferred behavior is: 

"I (the student teacher reporting) seem to favor Susan." This could 
better be described by stating: "I call upon Susan almost every time 

she raises her hand even when others volunteer." 

Try to give as detailed a description as possible of the behavior 
rather than listing generalized traits or abstractions. Attempt to be 
objective and unbiased in your reporting. 

Incidents of outstandingly effective and ineffective student teacher 
behavior should be reported as they occur. You will probably not find 
them occurring at any regular interval. Remember, the incident must in- 
volve some specific aspect of the student teacher's behavior that makes 
her outstandingly effective or ineffective in student teaching. Please 
reix)rt the situations without conferring vith the other individuals 
involved. 



The attached sheets contain a sample of a type of incident that you 
may be contributing this month. The format will remain the same as you 
supply Information about the situations you observe. Please become fam- 
iliar with ths information sheet and fill it in soon as an incident 
involving effective and ineffective student teaching behavior occurs. Do 
not wait. Each time an incident occurs, record the incident at once, *nd 
do not go back and chemge any of your statonents later. When you have 
four or five incidents recorded, send them in the stamped, self addressed 
envelope provided to: 

Miss Joan Gritzmacher 
Home Economics Education 
Martha Van Rensselaer 
Cornell University 
Ithaca, New York 

If you should need additional forms for recording the data, you 
may use a sheet of paper euid number the items or request additional 
forms from me at the above address. 

I will be most appreciative of your cooperation in this study. 

Sincerely yours. 



Enc. 



o 



B-2 



MATERIALS FOR COLLECTING 
CRITICAL INCIDENTS 
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SAMPLE CRITICAL INCIDENT OF STUDENT TEACHER'S BEHAVIOR 

Description-definition of critical incidents A critical incident in- 
volves the description of the student teacher's behavior that has re- 
sulted in her being either outstandingly effective or ineffective d'orlng 

the period of practice in which she graduaiUy assunes the role of teacher. 

Some of the areas in which this behavior might occur are: 

lo performance in the classrocm 

2. preparation for teaching 

3. relationships with faculty 
U. relationships with parents 

3* relationships with cooperating teacher 
6. extra curricular activities 
7* duties associated with teaching 

!• Name the people Involved in the situation. Place an X before your 
name . 

College Supervisor Mrs. Ready 
Cooperating Teache r Mrs, Childs 
X Student Teache r Miss Rider 
Other (Specify) 

2. Date the situation occurre d 2/2U/65 

3. School in which the situation took plac e Horac e^j faj^IL jS jj flL^SiiSS]^— 

U, Description of the situatio n I gave the 7th grade class a p ost 
test on their unit on Personal Relationships and found after 
yzlng the test that the pupils had not learned the content in a 

few Mreas. As a res ult I changed my lesson plcuis for th e r ^>t 

day (with the approval of Mrs. Childs) and retaught the content 
that the pupils apparently had not understood. 

5* What caused the situation? Inability to recognize that I was not 
getting through to the pupils. 

6. Why do you think that this behavior indicates effectiveness or 
ineffectiveness? When I realized that the p upils had done poorly 
in certain areas of the test. I decided that since this learn*»^g 
was important and since I had evidently not done a good job. ^hgd 
better reteach the content that they had not understood in anoi^ hy 
way. I wanted the pupils to attain the objectives that had been 
formulated for this unit. " 



7* What action did the people named in item 1 take as a result of the 
inc Ident ?^ W hen I gave the pupils a test on their beginning foods 
work at th^ of the next week. I added a small section on the 
Personal Rel{a.tiu.tish i ps conte nt that they had done so p ooi^"^ on be- 
fore. This time they did much better . ' 



SAMPLE CRITICAL INCIDENT OF STUDENT TEACHER'S BEHAVIOR 



Description-definition of critical incident; A critical incident in- 
volves the description of a student teacher's behavior that has resulted 
in her being either outstandingly effective or ineffective during the 
period of practice in which she gradually assumes the role of teeoher* 
Some of the areas in which this behavior might occur are: 



!• performance in the classroom 
2. preparation for teaching 
3« relationships with faculty 
k, relationships with parents 
3* relationships with cooperating teachers 
6* extra ciurriculcur activities 
7* duties associated with teaching 



1 . 



Name the people Involved in the situation. Place an X before your 
name. 

College Superviso r Miss Green 
X Cooperating Teache r Mias Smith 
Student Teache r Miss Brown 
Other (Specify) 



2. Date the situation occiurre d 2/15/65 

3. School in which the situation took plac e Gresley High School 

i*. Description of the situatio n Miss Brown was ignoring some of the 
V^^H_habits of the pupils in her 7 ^H_grad^e^^QQde_clAaa^^^^^^The^^^^iw>^^ 
not choo sing the proper measiging tools and were not sifting and 
leveling the flour correctly. Dishes and utensils in some cases 
were washed in dirty dishwater and equipment was occasionally put 
away in the wrong places. - ^ — 

5. What caused the situation? Miss Brown had not reviewed with the 

pupils procedures and stemdards before the lab. She had not con- 
sidered w ork habits when writing ob.1ectives« etnd evaluative pi<*^ps 
included only analysis of the final product and a paper-and-pencil 
test mainly on principles. ^ f n i , r , 

6. Why do you think that this behavior indicates effectiveness or 
Ipcffectiveneas? Full use of learning experiences is not 

place. Work standards are important aspects involved in food ^ 
preparation. i i ■ ■ 

7. What action did the people named in item 1 take as a result of the 
inc ident? Miss Brown and I discussed the laboratory as well as other 
classes in a conference after school. As a result Miss 

to have the pupils evaluate their learnings with emphasis on work 
habits during t^e next class session. 
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H9F0BM/VTI0N SHEET FOR REPORTING CRITICAL INCIDENTS OF 
STUDENT TEACHER'S BEHAVIOR 



Descrlptlon-definition of critical incident: A critical incident in- 
volves the description of a student teacher's behavior that has re- 
sulted in her being either outstandingly effective or ineffective 
during the period of practice in which she gr€uiual][y assumes the role 
of teacher* These incidents may be rather i ac illustrated by 
the example Involving evaluation of student learnings in the unit on 
Personal Relationships or relatively broad as illustrated by the 
example Involving work habits of pupils in foods lab. Some of the 
areas in which this behavior might occur are: 

1* performance in the classroom 
2* preparation for teaching 
3* relationships with faculty 
k, relationships with pcurents 

relationships with 'cooperating teacher 
6* extra curricular activities 
7* duties associated with teaching 



1 . 



Name the people involved in the situation. Place an X before your 
name . 

College Superviso r 
Cooperating Teache r 
Student Teache r 
Other (Specify) 



2. Date the situation occurred 

" " " " ' I a I ii y 



3* School in idiich the situation occurred^ 
k. Description of the situation 



What caused the situation? 



6. Why do you think that this behavior indicates effectiveness or 
ineffectiveness? 



7* What action did the people named in item 1 take as a result of the 
inc Ident ? 
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SAMPLE ANALYSES OF CRITICAL INCIDENTS 
OF STUDENT TEACHER BEHAVIOR 



Incident #270 



Reporter: ST 



If. 



A&WWASkVWXj W^«kBD*^tMUV4AO V* «bVAW4A VA ~ 



m«»Am 





^ 1m « 

yii% 



preparing of Swiss Steak. I was extremely nervous and uncer- 
tain - time ran out and the Swiss Steak was not even in the 
oven. The vegetables were never prepared as a demonstration. 
The planned student-teacher time table for procedure the fol- 
lowing day did not even get started. 



3* I have never prepared Swiss Steak before and had not rehearsed 
the dmnonstration, therefore I was very unsurr of what steps 
to take, how to do it, and whether or not it v^s coming out 
right. 

6. The students I know sensed this uneasiness on my part and so 
were inattentive. Demonstrations should be examples of "how 
to" as well as why and with a poor model it is impossible to 
know what to do on your own. 

7. Luckily I was able to repeat the same lab demonstration that 
afternoon on another class. This time it moved much more 
smoothly - also as a result of a brief noon conference with 
the teacher. The next day I simply outlined the procedure for 
the day - no student-teacher cooperated time plan was made. 



Incident #326 

Reporter: CT 

4. ST was aghast at her first 7th grade foods lab. It didn't 
take her long to see that she had overestimated their ability 
and underestimated the time in idiich they could do their work. 

3. So much to do and only 30 min. in which to do it. 

6. This problem is one that all foods teachers have. Everyone 
has to learn how to handle the individual situation. ST 
realized what was wrong and took care. 

7 • ST quickly made a lab sheet that was an excellent guide and 
a big help for the students. 



D-1 





D-2 




















CATEGORIES OF STUDENT TEACHER BEHAVIOR DERIVED FROM CRITICAL IKCIDENTS 



!• PersoneJ. QueJ.it ies 

a. voice 
b« poise 

c. assimption of responsibility 
2. Lesson Pleuuiing 

a. utilization of knowledge of students* abilities, background 
b« lesson goals 

c« structuring of subject-oatter (concepts, generalizations) 

d. preparation of questions to develop students* ability 
to think 

e. learning experiences 

f« teaching aids and materials 
g« evaluation of pupils 
3* Functioning in a Classroom Situation 

Sk* use of motivation/interest approach 

b. adaptation to students* level 

c« accuracy and conceptual development of subject matter 
d. use of techniques and methods 
e« use of teaching aids and materials 

f. use of review 

g« use of summary (drawing together of generalizations) 
h« application of learnings to situations outside the school 
!• asking questions and handling pupils* questions 
J. evaluation of students* progress 
4* Ability to Obtain Student Involv^ent 
3* Organization of Time 

a. classroom 

b« student teacher work en toto 

c. pre class preparation of aids, materials, etc. 

6. Alertness to Classroom Activity 

a. teacher direction and supervision 

b. classroom control 

c. behavior in emergency or unexpected situation 

7. Care of the Department 

8. Inter-personal Relationships 

a. rapport with cooperating teacher, faculty 

b. rapport with students 

c. rapport with parents 
9* Extra CurrlciJ.ar Activities 

10. Professional Qualities 

a. professional attitude and Judgment 

b. professional growth 
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Your Nam e 

Student Teacher's Name 
Institution 



COLLEGE SUPERVISOR AHP COOPERATING TEACHER OPINIONNAIRE ABOUT THE 
RATING SCALE 



Please place an "X” in the appropriate blanks* 



I The length of time required to fill out this rating scale vas 



5. 


39* 


1. 10 minutes or less 




21 


2« 11-20 minutes 


23, 


16 


3. 21-30 minutes 


16 , 


3 


h, 31-Ud minutes 


2. 


0 


Ul-^0 minutes 


2. 


1 


6. ^1 minutes - 1 hour 


3. 


0 


7« more than 1 hour 



II Regarding the amount of time required to fill out the ins- 



trument, I 






93, 


7U 


1. have no objection to the amount 
of time used* 


5. 


6 


2. feel that the scale is too time 
consuming* 



III What was the reaction of this student teacher to filling out 
her scale? (Do not inquire , but student teacher may have made 
some comments voluntarily.) 

I don't knov 

As far as I know she took it in 
stride and completed it. 

She said that she wasn't going 
to do it. 

U. She was agreeable about helping 
with the research. 

She complained about the time 
involved. 

6. Other— please explain 



16 , 


U7 


1. 


38, 


13 


2. 


0, 


2 


3. 


27. 


15 


u. 


1. 


0 


5. 


9. 


2 


6. 



IV What kind of instrument do you presently use for evaluating a 
student teacher's performance? 

0, 9 1. none 

U8, k3 2. rating scale with descriptive 

categories like this one 

22, 1 . 3. rating scale with descriptive 

adjectives, like superior, good, 
poor 

10, 3 U. check list 

27 5* other— please specif y 

"first number indicates frequency of CT response to item; second 
number indicates frequency of CS response 




H-1 



V If you use an instrument to evaluate your student teachers* 
hov does it compare with this rating scale? 

Which instrument requires a shorter length of time to complete? 



22* 


11 


1. 


the one we are presently using 


23* 


25 


2 . 


this rating scale 


51* 


35 


3. 


both require about the same 








amount of time 



Which instrument is easier to use? 

26 , 15 ____ !• the one we are presently using 

35* 28 _____ 2, this rating scale 

28 3. both are equally easy to use 

Which instrument provides a better basis for grading the stu- 
dent teacher? 

11| 6 1. the one we are presently using 

53* 3^ 2. this rating scale 

29 ___ 3. both provide the same basis 

Which instrument is better— all things considered? 

17 * 8 1, the one we are presently using 

U 5 * 33 2 . this rating scale 

35 » 22 ___ 3. both are of the same quality 

VI If this scale were available for your continued use* would you 
use it? 

67 * 52 1. Yes 

7* 6 2. No 

____ 3. Perhaps— please state qualification 



PLEASE FEEL FREE TO MAKE COMMENTS ABOUT THE RATING SCALE ON THE 
OTHER SIDE OF THIS SHEET. 



If you were goina to rate thir student teacher independently* what 
number grade would you give her using the following scede: Please 
write that number on the line. 



NOTE: 



90-100 

80-89 

70-79 

60-69 

below 60 



superior 
better than 
average 
poor 
failure 



average 



If you supervise more thou one stxident teacher* it is not 
necetsary to fill out the complete form for each one. Fill 
out the entire opinionnaire for the first student teacher and 
the top of the sheet and items 1* 2* and 3 and the last item 
regarding the grade for subsequent student teachers. 
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Nam e 

Institutio n 

STUDENT TEACHER OPINIONNAIBE ABOUT THE RATING SCALE 
Please place an "X” in the appropriate blanks. 



I 



The length of time required to fill out this rating scale was 



10» 


1. 10 minutes or less 




__ _ 2. 11-20 minutes 


21 


3. 21-30 minutes 


k 


U. 31-40 minutes 


2 


3. 41-50 minutes 


3 


6. 31 minutes - 1 hour 


0 ' 


7. more than 1 hour 



II Regarding the amount of time required tp fill out the instru- 
ment! I 

87 1. have no objections to the amount 

of time used. 

2 ______ 2. feel that the scale is too time 

consuming. 



Ill Hy feeling toward evaluating myself using this rating scale 
was 



positive because negative because 



75 


it gave me an objective 
basis for looking at my 
performance 


8 


I hate to rate my- 
self on anything 


26 


it seemed important 
professionally 


2 


it seemed unim- 
port ant 


23 


I could do it quite 
quickly 


2 


it took so much 
time 



IV Although your cooperating teacher and college supervisor did 
not see the ratings you made on yoiirself , would you have been 
willing to rate yourself if they had locked at your self 
evaluation? 

87 Yes 

2 No 

PLEASE FEEL FREE TO MAKE COMMENTS ABOUT THE RATING SCALE BELOW AND/OR 

ON THE OTHER SIDE OF THIS SHEET. 

* number indicates frequency of ST response to item 



I-l 



INFORMATION DESIGNED TO ACCOMPANY RATING SCALE IN FUTURE USE 



Develoment of the rating ecalet The items in this rating scale 
vere based on 9S5**crltlcal behaviors of student teachers reported 
independently by college supervisors » cooperating teachers » and 
student teachers* The critical behaviors vere then categorized and 
utilized in constructing a first rating scale vhlch consisted of 
112 unldlmenslonal items. The instrument vas used Independently 
by the three groups at the conclusion of studept teaching. Three 
hundred and ninety-nine usable scales vere returned end analyzed 
statistically* The major analysis sought to determine vhlch 
items would significantly predict the student teaching grade; 33 
items emerged from this analysis. These items then constituted the 
second stage rating scale; and the Instnment vas administered to 
a second sample* Two hundred and seventy-six rating scales vere 
returned. Cross validation vas quite satisfactory; some of the 
results of analysis of the 33 items scale arenas follovs: 





split-half reliability 


*96 




inter-rater reliability (college 
supervisor and cooperating 
teacher) 


*65 




validity coefficient (multiple 
correlation of 33 items and 
adjusted student teaching grades 
[to control for institutional 
variation]) 


.78 




ditto vith student teaching grade 
as given 


*73 




correlation of summed scale scores 
vith adjusted student teaching 
grade 


.8U 




ditto vith student teaching grade 
as given 


*79 




time to complete the insturment: 
less than 20 minutes 


6k% of par< 
tlclpants 


For further information on the Instrument » see Grltzmacher» 
Joan E. "Evaluation of Strident Teaching in Home Econcmlcs'% 
Ph.D. Dissertation, Ithaca, Nev York: Cornell University, 



1967. 
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